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Abstract 

For a sub-Riemannian manifold provided with a smooth volume, we relate the small time 
asymptotics of the heat kernel at a point y of the cut locus from x with roughly "how much" y 
is conjugate to x. This is done under the hypothesis that all minimizers connecting x to y are 
strongly normal, i.e. all pieces of the trajectory are not abnormal. Our result is a refinement 
of the one of Leandre At logpt(x, y) — ¥ —d(x, y) for t — > 0, in which only the leading exponen- 
tial term is detected. Our results are obtained by extending an idea of Molchanov from the 
Riemannian to the sub-Riemannian case, and some details we get appear to be new even in 
the Riemannian context. These results permit us to obtain properties of the sub-Riemannian 
distance starting from those of the heat kernel and vice versa. For the Grushin plane endowed 
with the Euclidean volume we get the expansion p t (x,y) ~ i~ 5 / 4 exp(— d 2 (x, y)/At) where y is 
reached from a Riemannian point x by a minimizing geodesic which is conjugate at y. 

1 Introduction 

The heat kernel on sub-Riemannian manifolds has been an object of attention starting from the 
late 70s [13l [13 UHl [HI [20l [25l [3H [371 SOI ESI El], as have the geodesies and cut and conjugate 
loci of such manifolds [21 [3j El El El [Ml EHl E21 EI]- In this paper, we provide a general approach 
to relate the sub-Riemannian distance to the small time asymptotics of the heat kernel at the cut 
locus, at least in the case when there are no abnormal minimizers to the relevant point in the cut 
locus. 

The problem of relating the sub-Riemannian distance to the heat-kernel is an old problem (see 
for instance ^ H3 HBl |2U |22l |23 EQl Eg Eg |Sl ISDl In the following we recall some of 

the most relevant results. Let M be a n-dimensional smooth manifold provided with a complete 
sub-Riemannian structure, inducing a distance d, and also provided with a smooth volume fj,. Let 
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number 239748, by the ANR Project GCM, program "Blanche", project number NT09-504490 and by the DIGITEO 
project CONGEO. 
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Pt(x,y) the heat kernel of the sub-Riemannian heat equation dt<p = A{p, where A is the sub- 
Riemannian Laplacian defined as the divergence of the horizontal gradient. In particular A could 
be the sum of the squares of a choice of vector fields defining the sub-Riemannian distance (possibly 
with a first order term belonging to the distribution). 

• On the diagonal. For some constant C > (depending on the sub-Riemannian structure 
and x, we have 

Pt (x,x) = C+ t Q// t) ' ■ (1) 



This result is due to Ben Arous and Leandre |19j . Here Q is the Hausdorff dimension of the 
sub-Riemannian manifold at x (see also |13j). 

Off diagonal and off cut locus. Fix x 7^ y. If y is not in the cut locus of x and there are 
no abnormals from x to y, then for some constant C > (depending on the sub-Riemannian 
structure, x, and y), one has 

Pt {x,y)= C ±°^e-^y^. 



This result is due to Ben Arous [18]. See also Taylor 
In any point of the space including the cut locus. 

\\mAt\ogp t {x,y) = -d 2 {x,y). (2) 



This result is due to Leandre [3E1 ES] (see also Taylor [5l] ) . It is very general but is rougher 
than the one of Ben Arous. Roughly speaking it says that both on and off the cut locus, the 
leading term for t — > has the form e~ d2 ( x ' y ^ 4:t . 

These results hold in particular in the Riemannian case. In that case we have Q = n and formula 
((2]) is the celebrated Varadhan formula obtained in |55| . 

In this paper we give a finer result with respect to the one of Leandre. We show that if y belongs 
to the cut locus of x and all minimizers connecting x and y are strongly normal (a minimizer is said 
to be strongly normal if every piece of it is not abnormal) then the rate of decay of pt(x, y) depends, 
roughly, on "how conjugate" x and y are, along the minimal geodesies connecting them. Intuitively, 
the more conjugate they are, the slower the decay. These results include Riemannian manifolds as 
a special case, for which they are completely general, since there are no abnormal minimizers in 
Riemannian geometry. Some details of the explicit relationship between the heat kernel asymptotics 
and the conjugacy of the minimal geodesies appears to be new even in the Riemannian context. 
Our results are also completely general for certain classes of sub-Riemannian geometries for which 
it is known there are no abnormals, such as contact manifolds and CR-manifolds. For a discussion 
of the presence of strictly abnormal minimizers in sub-Riemannian geometry one can see |27j . 
Our main result is Theorem 25 in Section [5j which relates the heat kernel asymptotics of pt(x, y) 



with what we call the hinged energy function 



h x , y (z) = ^(d 2 (x,z) + d 2 (z,y)), (3) 
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on the set of midpoints of all minimizing geodesies connecting x to y. To avoid overly complicated 
notation, we state here the following corollary which explains what happens in the case when the 
first terms of the Taylor expansion of h x , y have a simple expression. 

Corollary 1. Let M be an n- dimensional complete sub-Riemannian manifold provided with a 
smooth volume fx and let pt be the heat kernel of the sub-Riemannian heat equation. Given distinct 
x,y G M let h Xt y(z) be the hinged energy function. Assume that there is only one optimal geodesic 
joining x to y and that it is strongly normal, and let zq be the midpoint of the geodesic. 

Then h x ^ y {z) is smooth in a neighborhood of zq and attains its minimum at zq. Moreover, if 
there exists a coordinate system (zi, . . . , z n ) around zq such that we have the expansion 

h x , y (z) = ±d 2 (x, y) + zl mi +... + z 2m " + o(\ Zl \ 2m + ... + |z„| 2m "), 

for some integers 1 < mi < <m% < • • • < m n then for some constant C > ( depending on the 
sub-Riemannian structure, x, and y), one has 

C + o(l) ( d 2 (x,y) \ 

From an analysis of the relation between the expansion of h xy and the conjugacy of x and y 
we get 

Corollary 2. Let M be an n-dimensional complete sub-Riemannian manifold provided with a 
smooth volume fi and let pt be the heat kernel of the sub-Riemannian heat equation. Let x and y 
be distinct and assume that every optimal geodesic joining x to y is strongly normal. 
Then there exist positive constants C{, and to (depending on M, x, and y) such that 

(i) %-- d2 ^ v)tu < y) < ^mf d2{x ^^ foro<t< t . 

(ii) If x and y are conjugate along at least one minimal geodesic connecting them, then 
Pt(x, y) > tin/ £ (m e-*™'« for < t < t . 

(Hi) If x and y are not conjugate along any minimal geodesic joining them, then 

Pt(x,y) = ^^e- d2 ^, forO<t<t . 

Note that (iii) shows that the result of Ben Arous holds not only off the cut locus, but also 
on the cut locus if x and y are not conjugate. 

In the corollaries above the concept of sub-Riemannian manifold is quite general. It includes 
Riemannian manifolds and even sub-Riemannian manifolds which are rank-varying (see Sections 
[2] and Appendix [A] for the precise definition) . The estimates (i) and (iii) were already known in 
Riemannian geometry (see |33| and |43j respectively) while (ii) appears to be new even in the 
Riemannian context. 

The sub-Riemannian heat equation is intended with respect to the sub-Riemannian Laplacian 
which is defined as the divergence of the sub-Riemannian gradient. Here the divergence is computed 
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with respect to a smooth volume. In the equiregular case (see Definition |4j) the most natural 
volume is Popp's volume, introduced by Montgomery in his book [H]. The hypothesis that the 
sub-Laplacian is computed with respect to a smooth volume is also essential. For rank-varying 
sub-Riemannian structures or for sub-Riemannian structure which are not equiregular, one could 
be tempted to define a sub-Laplacian containing diverging terms with the Popp volume (which is 
also diverging). This approach is possible. However it provides completely different results with 
respect to those presented in this paper. See for instance [22] for this approach in the case of the 
Grushin and Martinet structures. 

In addition to these general bounds on the decay of pt(x,y) our approach provides a technique 
for computing the heat kernel asymptotics in concrete situations, subject, of course, to one's ability 
to determine explicit information about the minimal geodesies from x to y and the behavior of h X y 
near their midpoints (which is related to the conjugacy of the minimal geodesies). 

Conversely, these results allow us to realize the old idea of getting properties of the sub- 
Riemannian distance from those of the heat kernel (see for instance [61 HI] ) • 

The hypothesis that all optimal geodesies connecting x to y are strongly normal is essential. 
In the case in which x is reached by y along an abnormal minimizer, it is not clear how to mea- 
sure how much y is conjugate to x since abnormal extremals are not included in the exponential 
mapping and are in a sense isolated. The analysis of the heat kernel asymptotics in the presence 
of abnormal minimizers is an extremely difficult problem (also because of the lack of information 
about properties of the sub-Riemannian distance) and its study goes beyond the purpose of this 
paper. 

Remark. Notice that in our approach we start from the sub-Riemannian structure (M,D,g), 
then we define an intrinsic volume, and finally we build the Laplace operator naturally associated 
with these data. This operator is by construction symmetric, negative and has the form A = 
X? + Xq where {Xi}^ =1 define an orthonormal frame satisfying the Homander condition and 
X e span{JTi}£ =1 . 

In the literature one more often finds the reverse procedure [351 [30] (see also [16] and references 
therein). One starts from a second order differential operator with smooth coefficients C which is 
symmetric and negative with respect to a volume (jl, and then looks for a distance as a function of 
which one can give estimates of the fundamental solution of dt — C. This distance is constructed 
by introducing the so-called sub-unit curves for the operator, see for instance ([HI [21]). When C is 
of the form C = Y%=\ + where {Xi} k i= l are linearly independent vector fields satisfying the 
Homander condition, the symmetry with respect to \i implies that Xq S span{Xj}k =1 . Moreover 
the distance one gets is the sub-Riemannian distance for which {Xi}^ =1 is an orthonormal frame. 

Also, let us mention that a wide literature is available about operators of the type C = 
Yli=i Xf + Xq where {Xi}^ =1 satisfy the Homander condition, but Xq ^ span{Xj}^ =1 . 



1.1 Structure of the paper 

The structure of the paper is as follows. In Section [2] we introduce the concept of sub-Riemannian 
manifold. To avoid heavy notation, we have decided to restrict ourselves to the case in which the 
dimension of the distribution does not depend on on the point. The rank- varying case is postponed 
to Appendix [Aj All the results of the paper holds also in this case. 

In Sections [3] we state and prove a result expressing the heat kernel asymptotic as a Laplace 
integral over a neighborhood of the set of midpoints of minimal geodesies (see Theorem 20). In 
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Section [4] we discuss the asymptotics of Laplace type integrals and we discuss the relation between 
the degeneracy of the hinged energy function h x , y around the midpoints the minimal geodesies 
connecting x and y and the conjugacy of the minimal geodesies connecting them (see Theorem 22). 
Then in Section [5j we get our main general result, namely the estimates on the heat kernel pt{x, y) 
as a consequence of the previous analysis (see Theorem 25 ) . 

In Section [6] we apply our general results to some relevant cases. We briefly illustrate our results 
on the Heisenberg group for which both the optimal synthesis (i.e. the set of all optimal trajectories) 
from a given point and the heat kernel are known. 

The second example is the nilpotent free (3, 6) case. In this case we get an asymptotic expansion 
on the vertical subspace (see Section 6.3), where all points are conjugate along minimal geodesies, 
which agrees with the fact that there exists a one parameter family of optimal geodesic reaching 
these points. 

Finally in Section [7] we study the heat kernel in the Grushin plane, with respect to the standard 
Lebesgue measure. The Grushin structure is the rank-varying sub-Riemannian structure on the 
plane (x, y) such that X = d x and Y = xd y define an orthonormal frame. The corresponding 
sub-Laplacian is A = X 2 + Y 2 . Starting from the Riemannian point q$ = (—1, — tt/4) we get, for 
the asymptotic at the point q\ = (1, 7r/4), which is reached from qo by a minimizing geodesic which 
is conjugate at qi, the expression Pt(qo,qi) ~ i _5//4 exp(— d 2 (qo, qi) 2 /At), computing explicitly the 
degeneration of the hinged energy function. To our knowledge this is the first time in which an 
expansion of the type t~ a exp(— d 2 (qo, q±)/4t), with a / N/2 for an integer N, is observed in the 
Riemannian or sub-Riemannian context. 



2 Sub-Riemannian geometry 

We start by recalling the definition of sub-Riemannian manifold in the case of a distribution of 
constant rank k smaller than the dimension of the space. For the more general definition of 
rank-varying sub-Riemannian structure (including as a particular case Riemannian structures) see 
Appendix [Aj 

Definition 3. A sub-Riemannian manifold is a triple (M,T>,g), where 
(i) M is a connected orientable smooth manifold of dimension n > 3; 

(ii) T> is a smooth distribution of constant rank k < n satisfying the Hormander condition, i.e. a 
smooth map that associates to q G M a /c-dimensional subspace T> q of T q M such that 

span{[Xi, [. . . [Xj^Xj]]^ \ X t € V, j € N} = T q M, Vg G M, (4) 

where T> denotes the set of horizontal smooth vector fields on M, i.e. 

V = {X £ Vec(M) | X{q) G V q V q 6 M} . 

(Hi) g q is a Riemannian metric on T> q which is smooth as function of q. We denote the norm of a 



vector v G V q by \v 
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A Lipschitz continuous curve 7 : [0, T] — > M is said to be horizontal (or admissible) if 

7(t)eD 7(1) for a.e. t E [0, T]. 
Given an horizontal curve 7 : [0, T] — > M, the length 0/7 is 

%)= [ T \j(t)\ g dt. (5) 



Notice that £(7) is invariant under time reparametrization of the curve 7. The distance induced by 
the sub-Riemannian structure on M is the function 

d(qo,qi) = inf{^(7) | 7(0) = qo,^(T) = 31,7 horizontal}. (6) 

The hypothesis of connectedness of M and the Hormander condition guarantees the finiteness 
and the continuity of d(-, •) with respect to the topology of M (Chow-Rashevsky theorem, see for 
instance [llj). The function d(-,-) is called the Carnot-Caratheodory distance and gives to M the 
structure of a metric space (see [TT]). 

Locally, the pair (T>,g) can be given by assigning a set of k smooth vector fields spanning T> 
and that are orthonormal for g, i.e. 

V q = span{Xi(g), . . .,X k (q)}, g^X^Xjiq)) = <%. (7) 

In this case, the set {X\, . . . , X^} is called a local orthonormal frame for the sub-Riemannian 
structure. 

The sub-Riemannian metric can also be expressed locally in "control form" as follows. We 
consider the control system, 

m 

q = J2 u i X M, UiGR, (8) 
i=i 

and the problem of finding the shortest curve that joins two fixed points qo, q\ £ M is naturally 
formulated as the optimal control problem 



\ 



u}{t) dt -> min, q(0) = q , q{T) =qi^q . (9) 

i=l 

Definition 4. Define V 1 := V,V l+1 := V 1 + [D l ,T>], for every % > 1. A sub-Riemannian manifold 
is said to be equiregular if for each i > 1, the dimension of T> l q does not depend on the point q E M. 
For an equiregular sub-Riemannian manifold the Hormander condition guarantees that there exists 
(a minimal) m E N, called step of the structure, such that Vl 71 = T q M, for all q E M. The sequence 

g := (dimP,dim£> 2 ,...,dim£> m ), 

k n 

is called the growth vector of the sub-Riemannian manifold. The growth vector permits us to 
compute the Hausdorff dimension of (M, d) as a metric space (see |41j ) 



in 



Q = ^2iki, hi := dimV* -dimV*- 1 . (10) 

i=l 

In particular the Hausdorff dimension is always bigger than the topological dimension of M. 

Definition 5. A sub-Riemannian manifold is said to be nilpotent if M is a nilpotent Lie group 
and the sub-Riemannian structure is left-invariant with respect to the group operation. 
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2.1 Minimizers and geodesies 

In this section we briefly recall some facts about sub-Riemannian geodesies. In particular, we define 
the sub-Riemannian exponential map. 

Definition 6. A geodesic for a sub-Riemannian manifold (M,D,g) is an admissible curve 7 : 
[0, T] — > M such that |7(i)| g is constant and, for every sufficiently small interval [ti, tz] C [0, T], the 
restriction 7| t j is a minimizer of £(■). A geodesic for which |7(t)| g = 1 is said to be parameterized 
by arclength. 

A sub-Riemannian manifold is said to be complete if (M, d) is complete as a metric space. If 
the sub-Riemannian metric is the restriction to P of a complete Riemannian metric, then it is 
complete. 

Under the assumption that the manifold is complete, a version of the Hopf-Rinow theorem (see 
P^l Chapter 2]) implies that the manifold is geodesically complete (i.e. all geodesies are defined 
for every t > 0) and that for every two points there exists a minimizing geodesic connecting them. 

Trajectories minimizing the distance between two points are solutions of first order necessary 
conditions for optimality, which in the case of sub-Riemannian geometry are given by a weak version 
of the Pontryagin Maximum Principle ( |48| ) . 

Theorem 7. Let q(-) : t G [0,T] 1— > q(t) G M be a solution of the minimization problem 
such that \q(t)\ g is constant and u(-) be the corresponding control. Then there exists a Lipschitz 
map p{) : t G [0, T] h-> p{t) G T*^M \ {0} such that one and only one of the following conditions 
holds: 

dH dH 

(i) q = -Q^, P = —Qj> < l ) = <?(<)> x Mt))), 

where H(q,p) = l^ =1 (p,X t (q)) 2 . 

&H &H. 
(H)q=^, P = —q^: 0=(p(t),Xi(q(t))), 

where H(t,q,p) = Ya=i u i(*)(P> Xi(<l))- 
For an elementary proof of Theorem [7] see [3] . 

Remark 8. If (q(-),p(-)) is a solution of (i) (resp. (ii)) then it is called a normal extremal (resp. 
abnormal extremal). It is well known that if (q(-),p(-)) is a normal extremal then g(-) is a geodesic 
(see [31 E]). This does not hold in general for abnormal extremals. An admissible trajectory 
q(-) can be at the same time normal and abnormal (corresponding to different covectors). If an 
admissible trajectory q(-) is normal but not abnormal, we say that it is strictly normal. 

Abnormal extremals are very difficult to treat and many questions are still open. For instance 
it is not known if abnormal minimizers are smooth (see 



Definition 9. A minimizer 7 : [0, T] — > M is said to be strongly normal if for every [t\, £2] C [0, T], 
7|[t 1) t 2 ] is not an abnormal minimizer. 

In the following we denote by (q(t),p(t)) = c tH {qo,Po) the solution of (i) with initial condition 
(q(0),p(0)) = (qo,Po). Moreover we denote by ir : T*M —¥ M the canonical projection. 

Normal extremals (starting from go) parametrized by arclength correspond to initial covectors 
po G A qo := {p G T* Q M\ H(q ,p ) = 1/2}. 
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Definition 10. Let (M,T>,g) be a complete sub-Riemannian manifold and go E M. We define the 
exponential map starting from go as 

£ qo : A qo x M+ -> M, ^(po,*) = vr(e tf? (g ,Po))- (H) 

Next, we recall the definition of cut and conjugate time. 

Definition 11. Let qo £ M and j(t) an arclength geodesic starting from qo. The cut time for 7 
is t«tt (7) = sup{i > 0, 7|[o,t] is optimal}. The cui locus from go is the set Cut (go) = {7(*citf(7))>7 
arclength geodesic from go}. 

Definition 12. Let go £ M and 7(4) a normal arclength geodesic starting from go with initial 
covector pq. Assume that 7 is not abnormal. The first conjugate time of 7 is t con {^) = min{t > 
0, (j>o,t) is a critical point of S qo }. The (first) conjugate locus from go is the set Con(go) = 
{7(*con(7))) 7 arclength geodesic from g }. 

It is well known that, for a geodesic 7 which is not abnormal, the cut time t* = t cu t{^) is either 
equal to the conjugate time or there exists another geodesic 7 such that j(t*) = j(t*) (see for 
instance [9]). 

Remark 13. In sub-Riemannian geometry, the exponential map starting from go is never a local 
diffeomorphism in a neighborhood of the point go itself. As a consequence the sub-Riemannian 
balls are never smooth and both the cut and the conjugate loci from go are adjacent to the point 
go itself (see |T]). 



2.2 The sub-Laplacian 

In this section we define the sub-Riemannian Laplacian on a sub-Riemannian manifold (M,T>,g), 
provided with a smooth volume \x. 

The sub-Laplacian is the natural generalization of the Laplace-Beltrami operator defined on a 
Riemannian manifold, defined as the divergence of the gradient. 

The sub-Riemannian gradient can be defined with no difficulty. On a sub-Riemannian manifold 
(M,T>,g), the gradient is the unique operator V : C°°(M) — > V defined by 



,(V<p(q),v) = d<p q (v), V<^ G C°°(M), q € M, v G V. 



By definition, the gradient is a horizontal vector field. If X±, . . . is a local orthonormal frame, 
it is easy to see that it is written as follows Vip = Yl%=i ^i{'-P)^ii where Xi(<p) denotes the Lie 
derivative of ip in the direction of X$. 

The divergence of a vector field X with respect to a volume \x is the function divX defined by 
the identity LxH- = (divX)fi, where Lx stands for the Lie derivative with respect to X. 

The sub-Laplacian associated with the sub-Riemannian structure, i.e. Acp = div(V<£>), is written 
in a local orthonormal frame X% , . . . , X^ as follows 

k 

A = Y,x! + (divX l )X i . (12) 
i=i 

Notice that A is always expressed as the sum of squares of the element of the orthonormal frame 
plus a first order term that belongs to the distribution and depends on the choice of the volume 11. 



The existence of a smooth heat kernel for the operator (12), in the case of a complete sub- 
Riemannian manifold, is stated in |53j . 
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2.2.1 Popp's volume and the instrinsic sub-Laplacian 



In this section we recall how to construct an intrinsic Laplacian (i.e. that depends only on the 
sub-Riemannian structure) in the case of an equiregular sub-Riemannian manifold. 

On a Riemannian manifold the Euclidean structure defined on the tangent space defines in a 
standard way a canonical volume: the Riemannian volume. 

In the case of an equiregular sub-Riemannian manifold (M, T>, g), even if there is no global scalar 
product defined in T q M, it is possible to define an intrinsic volume, namely the Popp volume [44J. 
This is a smooth volume on M that is defined from the properties of the Lie algebra generated by 
the family of the horizontal vector fields. In the Riemannian case this coincide with the Riemannian 
volume. 

On an equiregular manifold of dimension 3 the Popp volume is easily defined as v\ A 1/2 A z/3, 
where 1/1,1/2,1/3 is the dual basis to Xi,X2 and [Xi,X2], where {Xi,X2} is any local orthonormal 
frame for the structure. This definition happens to be independent on the choice of X±,X2- For 
the general definition see, e.g 0]. 

Notice that the Popp volume is not the unique intrinsic volume that one can build from the 
geometric structure of (M,D,g). Since a sub-Riemannian manifold is a metric space (with the 
Carnot-Caratheodory distance), one can define the Q-dimensional Hausdorff measure on M, where 



Q is defined in (10). In contrast with the Riemannian case, starting from dimension 5, the Q- 
dimensional Hausdorff measure the does not coincide in general with Popp's (see [U [T4] for details 
about these results). 

The intrinsic sub-Laplacian is defined as the sub-Laplacian where the divergence is computed 
with respect to the Popp volume. 

Remark 14. In the case of a left-invariant structure on a Lie group (and in particular for a nilpotent 
structure), the Popp volume is left-invariant, hence proportional to the left Haar measure. 

For unimodular Lie groups, and in particular for nilpotent groups, one gets for the intrinsic 
sub-Laplacian the "sum of squares" form (see [6] ) 

i=i 

Remark 15. To define the Popp volume the equiregularity assumption is crucial. In the non- 
equiregular case or in the rank- varying case the Popp volume diverges approaching the non-regular 
points, as do the coefficients of the intrinsic sub-Laplacian [6l I22j. 



3 General expression as a Laplace integral 

From now on by a sub-Riemannian manifold we mean a structure in the sense of Section [2] or 
Appendix [XJ which include as a particular case Riemannian structures. 

In the following we denote by E C M x M the set of pairs {x, y) with x 7^ y such that there 
exists a unique minimizing geodesic from x to y and such that this geodesic is strictly normal and 
not conjugate. Notice that E is an open set inMxM (see JTOj US] and O Chapter: Regularity of 
SR distance]). 

Recall that the heat kernel is the the fundamental solution of the heat equation dt<p = A(p, 
where A is the sub-Riemannian Laplacian defined with respect to some smooth volume [i on a 



9 



sub-Riemannian manifold M. We begin by recalling the asymptotic expansion of the heat kernel 
away from the cut locus, due to Ben Arous [18] (see Theorem 3.1, and adjust for the fact that our 
heat kernel is for "A" rather than "A/2"). 

Theorem 16. Let M be an n- dimensional complete sub-Riemannian manifold in the sense of 
Section^ or Appendix^A^ with a smooth volume n and associated heat kernel pt and let (x,y) 6 S. 
Then for every non-negative integer m, we have the following asymptotic expansion as t \ 0; 

Pt (x,y) = ^exp (-^J^) (f> i ( a ^ + C>r +1 ) 

\j=0 

Here the Ci are smooth functions on £ with co(x,y) > 0. Further, if K C S is a compact set, then 
the expansion is uniform over K . 

We will also need some preliminary control of the heat kernel at the cut locus, which is provided 
by a well-known result of Leandre [2D]. In particular, Theorem 1 of [38J and Theorem 2.3 of |39j 
give (again taking into account our normalization of the heat kernel) 

Theorem 17. Let M be a complete sub-Riemannian manifold with a smooth volume \i and as- 
sociated heat kernel pt- For any compact subset K of M x M , the following holds uniformly for 
(x,y) e K: 

lim4tlogpt(x,y) = -d 2 (x,y). 



Remark 18. Theorem 16 and 17 were originally stated in M. n for sub-Riemannian metrics whose 
orthonormal frame consists of vector fields which are bounded with bounded derivatives. However, 
it is not hard to see that these results hold in the more general context of complete sub-Riemannian 
structures (where closed balls are compact). 

Notation. In what follows we use sometimes the abbreviation E(x,y) = d 2 (x,y)/2 for the 
energy function. For any two distinct points x and y, we let T be the set of midpoints of minimal 
geodesies from x to y. Further, we let N(T) be a neighborhood of T, which we will feel free to 
make small enough to satisfy various assumptions. Finally, we let h X)V (z) = E{x,z) + E(z,y) be 
the hinged energy function. It's clear from the definition that h x ,y(z) is continuous. 

Lemma 19. The function h x>y attains its minimum exactly on V and mmh xy = d 2 (x,y)/4:. 

Proof. Let us consider a geodesic joining x and y and denote its midpoint by zq. We want to prove 
that h Xt y(zo) < h Xt y{z) for every z and that we have equality if and only if z is a midpoint of a 
geodesic joining x and y. 

Let a = d(x,zo) = d(zo,y), b = d(x,z), and c = d(y,z). By the triangle inequality, we have 
2a < b + c. Moreover we can assume that both b and c are less than or equal to 2a, since otherwise 
the statement is trivial. Let e > be such that 2a + e = b + c and compute 

h*M = \( b2 + c ') = \(( 2a + e " c ) 2 + c ') 

£ 2 

> a 2 + (a - c) 2 + — + e(2a - c) > a 2 = h X: y(zo) 

Moreover we have equality in the two inequalities if and only if e = and a = c, which is precisely 
the case where z is the midpoint of a geodesic joining x and y. Finally h Xt y(zo) = d 2 (x, y)/4. □ 
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We will need some basic assumptions about, and properties of, N(T). First, basic properties of 
the distance function on M imply that T is compact. Next, all of our work will take place under 
the condition that we are "away from" any abnormal geodesies. In particular, assume that x and y 
are distinct and that every minimizer from x to y is a strongly normal geodesic. While we certainly 
allow y to be in the cut locus of x (which is a symmetric arrangement), the midpoint of every 
minimal geodesic from x to y will be a positive distance from the cut loci of both x and y. 

More precisely, let A 6 T*M be a covector such that £ x (tX) for t £ [0, d(x, y)] parametrizes a 
minimal geodesic from x to y. (Here we adopt the convention that £ X {X) = it o e H (x, A).) Call this 
geodesic 7, and let zq be its midpoint. Since the cut time along 7 is at least d(x, y) and since the 
cut time is continuous as a function on T*M near A, it follows that £ x is a diffeomorphism from a 
neighborhood U of (d(x,y)/2)X to a neighborhood U' of zq = £ x ((d(x,y)/2)X). Further, assuming 
U small enough, there is a unique minimal geodesic from x to each point £ x (0 m U' given by 
£ x (s£.) for s 6 [0, 1], and this geodesic is not conjugate. In the case when £ = (d(x,y)/2)X, we have 
the "first half" of 7, from x to zq. Because 7 is strongly normal, this piece of 7 is strictly normal. 
Because the property of being strictly normal is an open condition on geodesies (see JTOj, 09] and 
[3j Chapter: Regularity of SR distance]), after possibly shrinking U and U' we have that all of the 
minimal geodesies from x to points in U' are also strictly normal. 

One consequence is that by choosing U (and thus U') small enough, the distance function from 
x is smooth on U' (which we recall is some neighborhood of Zq, the midpoint of a minimal geodesic 7 
from x to y). Another is that (after possibly further shrinking U and U') the Ben Arous expansion 
holds for pt(x, z) uniformly for z £ U' . 

Note that the discussion in the previous paragraph also holds if we reverse the roles of x and 
y. Then, since T is compact, we see that for sufficiently small neighborhood N(T) the distance 
functions from both x and y are smooth on N(T) and the Ben Arous expansion holds for both 
Pt(x, z) and pt(y, z) uniformly for z 6 N(T). It follows that h x ,y is also smooth on iV(r). These are 
the key consequences of assuming that every minimizer from x to y is a strongly normal geodesic. 
We will also occasionally take advantage of the structure of the exponential map based at either x 
or y in a neighborhood of any point z £ N(T). From now on, we will assume that, for such x and 
y, N(T) is chosen in this way. 

We now describe the main idea for determining the expansion on the cut locus. The intuition 
benefits from recalling that the heat kernel is also the transition density of Brownian motion on M. 
By the semi-group property (or the Markov property, from a stochastic point of view), a particle 
that travels from a point x to a point y 7^ x in time t first goes to some "halfway" point at time t/2, 
and then continues the rest of the way to y. For small t, a particle traveling from x to y is most 
likely to do so via a path which is approximately a geodesic (traversed at uniform speed). This is 
the usual intuition from large deviation theory. Thus, at time t/2, such a particle is likely to be near 
the midpoint of some minimal geodesic from x to y. The key insight, originally due to Molchanov 
|43j in the Riemannian case, is that, even in the case y £ Cut(x), we can choose N(T) as just 
discussed so that the expansion of Ben Arous can be applied to both the first and second halves of 
the particle's journey from x to y (at least with high probability). The expansion at the cut locus 
is thus obtained by "gluing together" two copies of the Ben Arous expansion along the midpoints 
of the minimal geodesies form x to y. Making this argument precise, using only geometric analysis 
(we use stochastic notions only to bolster our intuition in the present paper), provides the proof of 
the next theorem. (The same "gluing idea" was employed to compute asymptotics of logarithmic 
derivatives in the Riemannian case in I47j.) 
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Theorem 20. Let M be an n- dimensional complete sub-Riemannian manifold in the sense of 
Section^ or Appendix\A^ and let x and y be distinct points such that all minimal geodesies from x 
to y are strongly normal. Then for N(T) and cq as above there exists 5 > such that 



Pt(x,y) 



N(T) 



-"•■»«/* (c (x,z)c (z,y) + O{t)) fi{dz) 



-E(x,y)/2-S 



+ o exp 



Here the 0{t) term in the integral is uniform over N(T). 

Proof: By the semi-group property (or Chapman-Kolmogorov equation, for probabilists), we 
have 

Pt{x,y) 



M 



Pt/2(x,z)p t/2 (z,y) n(dz). 



We first divide M into two regions, N(T) and M\N(F). As just discussed, both pt(x, •) and pt(-, y) 
are uniformly approximated by the Ben Arous expansion on N(T) (since we assume that e > is 
sufficiently small). Using just the first term, we see that 



Pt(x,y) 



N(r) 



+ 



M\N(T) 



-h*A*yt {co{x , z)co{Ziy) + o(t)) n{dz) 

Pt/2(x,z)p t / 2 (z,y) fi(dz), 



where the 0(t) terms are uniform over N(T) by the uniformity of the Ben Arous expansion there. 
Next, we estimate the integral over M \ N(T). First, assume that M is compact. By Theorem 



17 we have that, on M, 



Pt(u,v) 



exp 



-d 2 (u,v)/2 + r(t, u,v) 
2t 



where r(t, u, v) goes to zero uniformly with t on all of M. (In the remainder of the proof, we will 
use r to denote a function with this property, the exact definition of which may change from line 
to line.) We see that 

-h x , y (z) +r(t,z)~ 



Pt/2(x,z)p t/2 (z,y) = exp 



t 



Further, the minimum of h x ^ y {z) on M\N(T) is strictly greater than h Xty (T) = E(x,y)/2. Because 
M \ N(T) has finite volume (by compactness), we see that there exists 5 > such that 



/ Pt/2(x,z)p t /2(z,y) v(dz) 

Jm\n(t) 



o yexp 



-E(x,y)/2-S 



(13) 



Next, consider the case when M is not compact. Then for large enough R, we see that x, y, 
and iV(r) are all inside of B X {R) (the ball of radius R centered around x). We split the intergal 
over M \ N(T) into an integral over Br{x) \ N{T) and an integral over M \ Br(x). The previous 
argument can be applied to the integral over Br(x)\N(T). Further, for large enough R, the integral 
over M \Br(x) is also o (exp [(— E(x,y)/2 — 5)/t\). Thus Equation (13) holds in the case when M 
is non-compact as well. Combining these estimates completes the proof. □ 
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This theorem, in principle, gives the small-time asymptotics of the heat kernel in great generality. 
To get more concrete information, one needs to be able to determine the small-time asymptotics of 
the integral over N(T). Fortunately, this is a well-studied type of integral, called a Laplace integral, 
as we shall discuss shortly. 

Finally, we have stopped with the first term of the Ben Arous expansion only for convenience. 
As much of that expansion can be kept as desired, in which case the cq(x, z)cq(z, y) + 0(t) in 
the integrand is replaced by a more general product of Taylor series. However, it is unclear how 
much additional information this really provides. It seems that relatively little is known about the 
functions cq, and higher-order coefficients in the Ben Arous expansion are even less well- understood. 
Further, including such terms means that we would also want to determine higher-order terms in 
the asymptotic behavior of the Laplace integral over N(T), which doesn't seem practical in general. 
For these reasons, we content ourselves with the leading term. 



4 Understanding the Laplace integral 



We wish to determine the asymptotics of the integral that appears in Theorem 20 To this end, 
we first review this type of integral, from which we see that the behavior of h xy near T is the key 
factor. Then we discuss the geometric meaning of the behavior of h x ,y in terms of the conjugacy 
of minimal geodesies from x to y. 

4.1 A brief discussion of Laplace asymptotics 

We now discuss techniques for determining the small t asymptotics of integrals of the type 

f(x)e- 9(x)/t dx. (14) 

D 

Here D is a compact set of W 1 having the origin in its interior, / is smooth in a neighborhood of 
D, and g is a function which is smooth in a neighborhood of D, is zero at the origin, and is strictly 
positive on D minus the origin. (We also assume the integral is with respect to Lebesgue measure; 
to treat any other measure with a smooth density we can simply incorporate the density into /.) 
Our assumption that g has zero as its minimum is no loss of generality; for any a £ M we have that 

f(x)e- {9+a)/t dx = e- a/t [ f{x)e-^ x)/t dx. 

D JD 

We start with the one-dimensional case. Further, we assume that g can be written as x 2m for 
some integer m > 1. Again, if this can be accomplished by first performing a smooth change of 
coordinates (and possibly shrinking D), we can just absorb the Jacobian into /. While it is not 
always possible to find such a change of coordinates, this is the most important case. Then (see, 
for example, [29 ) 

f{x)e- x2m ' t dx = /(0) r ( 1 /(2m)) t i/( 2m) + Q ( t 3/(2m)\ ag f ^ Q 
D m \ ) 

(here 'T" is the usual Gamma function, not the set of midpoints of minimal geodesies). We note 
that higher terms in this expansion are known, but in the present context we continue to focus only 
on the leading term. 
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The higher dimensional situation is more complicated. If we assume that g can be written as 



9(x) = 



2m, 



i=l 



for some integers 1 < mi < m<i < • • • < m n , then the expansion essentially decomposes as a product 
of one-dimensional integrals. This immediately gives 



f{x)e-^ x)/t dx = t l 



1 

2 m. , 



D 



/(o) n 



r(l/2mi) 



+ o(t 



mi 



1/m.r 



(15) 



In particular, if the Hessian of g is non-degenerate at the origin, the Morse lemma guarantees that 
we can always find coordinates near the origin in which g is a sum of squares, and thus the above 
expansion holds in these coordinates with m; = 1 for all i. However, if the Hessian is degenerate, 



it will not necessarily be true that g can be put into the form of Equation (15) by a smooth change 
of coordinates. 

Nonetheless, we recall that the "splitting lemma" for smooth functions (which can be found 
in [32]) allows us to split off non-degenerate directions and thus partially diagonalize g. In that 
spirit, the following result guarantees that, around an isolated degenerate critical point of corank 
1, there always exists a coordinate set in which g is diagonal. It is a generalization of the classical 
Morse lemma for nondegerate critical points and is a particular case of the splitting lemma just 
mentioned. 

Lemma 21. Let g be a smooth function on a neighborhood of the origin in W 1 , such that the origin 
is a local minimum of g and the only critical point of g. Assume that g(0) = dg(0) = and that 
dimker d 2 g(0) = 1. Then there exists a diffeomorphism (p from a neighborhood of the origin to a 
neighborhood of the origin and a smooth function ip : K — )■ M such that 



n-l 

g{vtu)) = + ip(u r , 

i=l 



where 



More generally, suppose that g is equal to its Taylor series near the origin. Even this doesn't 
cover all possible cases (in particular, if g is smooth but not real-analytic), but it seems to be the 
most general case for which there is a satisfactory theory. In the case where g is equal to its Taylor 
series near the origin, Arnold and his collaborators (see [12] and the references therein) have given a 
powerful analysis of the resulting asymptotics. Briefly, if g is real-analytic with a unique minimum 
of zero at the origin then the leading term in the expansion (assuming /(0) ^ 0) is of the form 
cf(0)t a \ logi| m where c is a positive constant, a is a positive rational, and m is an integer between 
and n — 1 inclusive. Estimates on a and m can be given in terms of combinatorial information 
derived from which monomials in the Taylor series of g have non-zero coefficients (more precisely, 
one looks at various features of the Newton diagram of g). Moreover, generically (in a sense which 
can be made precise) a and m are determined by this combinatorial information. 

The above assumes that g has an isolated minimum at the origin. Suppose, instead, that g 
assumes its minimum along some smooth submanifold. In this case, one can choose coordinates for 
the minimum set and then extend them to coordinates near the minimum set by adding coordinates 
for the normal bundle. Then at each point of the minimum set, one can try to apply the above 
analysis to the corresponding fiber of the normal bundle, and then attempt to integrate the result 
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over the minimum set. The simplest such case is when g is a Morse-Bott function, in which case 
the asymptotics on each fiber will be just those corresponding to a non-degenerate Hessian of the 
appropriate dimension (this is what we see, for example, for the Heisenberg group in Section [6]), 
although in general the situation can be more complicated. 

In the case when this is not possible (for example, if the minimum set has a more complicated 
structure than a submanifold) , a somewhat more general statement can be made. If g is real- 
analytic, one can use a resolution of singularities to reduce the situation to that of a sum of 
integrals of the form given in Equation (14), where in each term of the sum g is a monomial in the 
new coordinates and / is a smooth function times the absolute value of a monomial. (Essentially, 
the resolution of singularities amounts to a type of generalized change of coordinates under which 
g has this more restricted form.) The small-time asymptotics in such a case are again given by a 
rational power of t times an integer power of | log 1 1 . In contrast to the above case of an isolated 
minimum, here there does not seem to be a way of understanding the powers of t and | log t| without 
determining the resolution of singularities and computing the asymptotics of each of the resulting 
integrals. 

The interested reader is referred to the references above for complete details, or to Sections 3.5 
and 3.6 of [IB] which contain a more detailed summary of these results (and which seems too much 
of a digression to repeat here). 



4.2 Conjugacy and the behavior of h x>y 

We now discuss how the behavior of h x ,y near its minima relates to the structure of the minimal 
geodesies from x to y, specifically, to the conjugacy of these geodesies. Suppose we have distinct 
points x and y such that every minimizer from x to y is strongly normal. We begin by introducing 
notation. 

Consider any point zq G T, which corresponds to some minimal geodesic 7 from x to y. 
Then there is a unique covector A G T*M such that £ x {2\) = y and that £ x (2X,t) for t G [0,1] 
parametrizes 7. (Recall that £ x {X) = it o e H (x, A).) 

Let A(s) be a smooth curve of covectors A : (— e, e) —> T*M (for some small e > 0) such that 
A(0) = A and the derivative never vanishes. Thus A(s) is a one-parameter family of perturbations 
of A which realizes the first-order perturbation A'(0) 6 T\ (T*M). Also, we let z(s) = £ x (X(s)), so 
that z(0) = zq. Because £ x is a diffeomorphism from a neighborhood of A to a neighborhood of zq, 
we see that the derivative of z(s) also never vanishes. Thus z(s) is a curve which realizes the vector 
z'(0) G T Zo M. Further, we've established an isomorphism of the vector spaces T\ (T*M) and T Zo M 
by mapping A'(0) to z'(0), except that we've excluded the origin by insisting that both vectors are 
non-zero. 

We say that 7 is conjugate in the direction A'(0) (or with respect to the perturbation A'(0)) if 
j^£(2X(s))\ s= o = 0. Note that this only depends on A'(0). We say that the Hessian of h XtV at zq is 

degenerate in the direction z'(0) if ■j^h Xty (z(s))\ s= o = 0. This last equality is equivalent to writing 
the Hessian of h xy as a matrix in some smooth local coordinates, applying it as a quadratic form 
to ^'(0) expressed in these coordinates, and getting zero. This equivalence, as well the fact that 
whether the result is zero or not depends only on z'(0), follows from the fact that zq is a critical 
point of h x>y . 

The point of the the next theorem is that conjugacy in the direction A'(0) is equivalent to 
degeneracy in the direction z'(0). Thus the Hessian of h x ,y encodes information about the conjugacy 
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of 7, and it is a more geometric object than it might seem at first. 

Theorem 22. Let M be a complete sub-Riemannian manifold in the sense of Section^ or Appendix 
\A\ and let x and y be distinct points such that every minimal geodesic from x to y is strongly normal. 
Define T, zq 6 V, h x>y and the curves X(s),z(s) as above. Then 

(i) 7 is conjugate if and only if the Hessian of h XtV at zq is degenerate. 

(ii) In particular 7 is conjugate in the direction A'(0) if and only if the Hessian of h x ^ y at zq is 
degenerate in the corresponding direction z'(0). 

( Hi ) The dimension of the space of perturbations for which 7 is conjugate is equal to the dimension 
of the kernel of the Hessian of h x>y at zq. 

Proof. We know that there is a unique shortest geodesic from y to z(s) for all s 6 (— e, e), assuming 
e small enough. Let r](s) be the corresponding smooth curve of covectors in T*M (that is, £ y (tn(s)) 
for t £ [0, 1] parametrizes the minimal geodesic from y to z(s)). Let A(s) and f)(s) be the images 
of A and 77, respectively, under the corresponding Hamiltonian flow on the cotangent bundle. We 
see that A(0) + fj(0) = 0. 

Observe that d(E(x, -))\ z ( s ) = A(s) and d(E(y, = fj(s). (Here d stands for the differential.) 
It follows that 

dh x , y \ z{s) = \(s)+fj(s) (gt; (s) m). 

Next note that 7 is conjugate in the direction A'(0) if and only if X(s) + fj(s) = 0(s 2 ), as follows 
directly from consideration of the exponential map. Thus 7 is conjugate in the direction A'(0) if 
and only if dh Xj y\ z r s -\ = 0(s 2 ). 

We claim that dh XtV \ z r s \ is 0(s 2 ) if and only if h XjV (z(s)) is 0(s 3 ). Equivalently, the derivative 
of dh XtV (as a one- form) in the z'(0) direction is zero if and only if its pairing with z'(0) is zero. 
This relationship is most easily expressed in local coordinates. Let H be the n x n matrix for the 
Hessian of h x y at zq in some local coordinates, and let v be ^'(0) expressed in these coordinates. 
Then the derivative of dh x ^ y in the z'(0) direction is Hv , which we think of an operator on vectors 
u by {Hv, u) where is the standard Euclidean inner product for these coordinates, or equivalently 
by u 1 Hv where u and v are written as column- vectors. 

The claim now follows from the following simple fact from linear algebra: for any symmetric 
and positive semi-definite n x n real matrix A, we have that, for any x £ M n , (Ax,x) = if and 
only if Ax = G M n , where (•, •) is the standard Euclidean inner product. Because A is symmetric, 
we can find an orthonormal basis Vi, i = l,...,n for W 1 cosnisting of eigenvectors of A with 
corresponding eigenvalues Aj > 0. Then, writing x = ~Y^i=i x i v ii the above fact follows from the 
identities Ax = ^r=i an< ^ {Ax,x) = J27=i ^i x i- 

Since the Hessian of h X)V at zq is symmetric and positive semi-definite, the claim follows. Thus 
we have proven statement (ii) in the theorem (and (i) a fortiori), namely that 7 is conjugate in the 
direction A'(0) if and only if the Hessian of h X)V at zq is degenerate in the direction z'(0). 

Statement (Hi) is an immediate consequence of (ii) plus the fact that the correspondence be- 
tween A'(0) and z'(0) gives an isomorphism of vector spaces between T\ (T*M) and T Zo M, as 
discussed just before the theorem. □ 

We now briefly discuss the situation of higher-order derivatives of the exponential map and 
higher-order derivatives if h x ^ y . This situation is more complicated than what we just saw for 
lower-order derivatives. 
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Recall that £ X (2X) = y. Further, consider 



d m 

£ x (2X(s))\ s=Q . 



ds 

For m = 1, this is zero if and only if y is conjugate to x along the geodesic through zq in the 
direction of A'(0). If this first derivative is zero, then the number of higher order derivatives which 
vanish describes, in a sense, how conjugate y is to x with respect to the perturbation 2A(s). (Of 
course, it's possible for all derivatives to vanish; for example, if 2X(s) describes a one-parameter 
family of minimal geodesies from x to y, as occurs for the Heisenberg group.) We can compare the 
vanishing of these derivatives to the vanishing of the derivatives ^kh x ,y(z(s)). 

Suppose that, for some positive integer m, we have that £ x (2X(s)) = ws m + 0(s m+1 ) for some 
non-zero w £ M. n in some (smooth) system of coordinates around y (so that y is at the origin of these 
coordinates). If this holds in one such system, in holds in any other such system with w re-expressed 
in the new coordinates. Because the exponential map is a diffeomorphism from a neighborhood of 
each fj(s) 6 T*,-,M to a neighborhood of y, we see that this expansion for £ x (2A(s)) is equivalent 
to having 

X(s) + fj(s) = dh x , y \ z{s) = vs m + 0(s m+1 ) € T* {a) M 

for some non-zero one-form v written with respect to some (smooth) system of coordinates around 
zq. Again, if this holds for one such system, it holds for any other such system with v re-expressed 
relative to the new coordinates. 

Thus, the one- form dh x ,y\ z {s) vanishes to the same order as the derivatives of £ x (2X(s)). How- 
ever, when we look at h Xj y(z(s)), we see that 

s 



h x , y (z(s)) - h X) y(z ) = / dh X} y\ z (t)(z'(t))dt 



o 



(v(z'(t))t m + 0{t m+1 )) dt = —*—v(z'{0))s m+1 + 0{s m+2 ). 
v ' m + 1 



So we have that h x ^ y {z(s)) — h X: y(zo) = cs m+1 + 0(s m+2 ) for non-zero c if and only if v(z'(0)) ^ 0. 
For m = 1, it is always the case that v(z'(0)) ^ 0, as we saw in the previous theorem. However, 
for m > 1, this need not be true. In such a case we can only conclude that h X} y(z(s)) — h x ^ y {zo) = 
0(s' m+2 ), and the exact order of vanishing of the derivatives of h x ^ y is unclear in general. 

Remark 23. In the special case when M is two-dimensional and zq is an isolated minimum such that 
h Xt y vanishes to finite order at zq, the relationship is simpler. Namely, because the Hessian of h XiV is 



clearly non-degenerate along the direction of 7, we can apply Lemma 21 to write h x , y = u\ + 5(^2) 
for some coordinates U{ around zq and some smooth function g. Then if z(s) corresponds to the 
curve {u\,U2) = (0,s), we see by direct computation that v(z'(0)) / 0. In this way, the degree of 
degeneracy of the Hessian and the degree of conjugacy correspond precisely in this case. 

We also note that, at the opposite extreme, there is again a nice correspondence between the 
behavior of the exponential map and of h xy . Namely, £ x (2X(s)) = y for all s S (— e, e) if and only 
if h XtV (z(s)) = h Xt y(zo) for all s G (— e, e), as follows directly from Lemma 
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All of this seems to indicate that, loosely speaking, the more conjugate the geodesic through 
zq is, the more degenerate h x ^ y is at zq. However, it also seems that looking at curves through zq 
corresponding to one-parameter perturbations of the geodesic is too naive in general, and that a 
more sophisticated approach is needed to describe the exact relationship between the higher order 
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derivatives of the exponential map and higher order terms in the Taylor series of h xy . As we do not 
need anything beyond the results of Theorem 22 in what follows (except perhaps to give geometric 
intuition to h x>y ), we do not pursue this direction any further. (In light of the above, it seems that 



the claims about higher-order derivatives in Lemma 3.1 of [46] are over-simplified. Fortunately, in 
that paper, as in the present, only the content of Theorem 22 is used in subsequent arguments. 
The higher-order relationship serves only to provide a more geometric meaning to the behavior of 



5 General consequences of Laplace asymptotics 

We are now in a position to see what the theory of Laplace asymptotics summarized in the previous 
section gives when applied to the integral in Theorem 20 The ideas expand upon those of Section 
5.3 of [33], where inequality (18) of Theorem 25 is given in the Riemannian case. We note that the 
results we give in this section, most interestingly those which depend on whether or not x and y 
are conjugate along a given geodesic, are also valid in the Riemannian case. 

We begin with a basic lemma. For this lemma, we say that the itj are "coordinates around zq" 
if they are coordinates on some neighborhood of zq such that Ui(zo) = for all i. Inequalities for 
such coordinates are understood to hold on some such neighborhood. 



Lemma 24. Under the assumptions of Theorem 22, let z$ be any point of V . Then there exist 
smooth coordinates U\, . . . , u n around zq such that 



h XjV (ui, ...,u n )> ^d 2 (x, y) + u\. 

Also, there exist smooth coordinates v±, . . . ,v n around zq such that 

1 



(16) 



h 



x ,y{vt, . . . , v n ) < -d (x, y) + v x H h v n . 



Finally, if the geodesic from x to y passing through zq is conjugate, then the v% can be chosen so 
that 



h x Jvi, ...,v n )< -d 2 (x,y) + v\ H h v 2 l _ 1 



Proof. For z in some neighborhood of zq, let u\{z) = u\ = d{x, z) — (d(x, y)/2). If the neighborhood 
is small enough, this is a smooth function with non-vanishing derivative (since d(x, z) has these 
properties as a function of z) and ^1(^0) = 0. Thus it is a valid coordinate, and we can complete 
this to a full set of coordinates around zq. Further, the triangle inequality gives 

d(y, z) > d(x, y) - d(x, z) = \d(x, y) - u\. 



Thus we compute 



h x , y (z) = - [d(x, z) 2 + d(y, z) 2 ] 



> 



ui + -d(x, y) ) + 



d(x,y) - ui 



u\ + -d(x,y) 2 . 
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which gives the estimate (16). 

For the second estimate, recall that h x ,y is smooth and assumes its minimum at zo, and thus 
the derivative of h x ^ y vanishes at zq. It follows that for any system of coordinates Wi around zq, 
there is a small enough neighborhood of zq and positive constant C such that 

h x>y {vi, ...,v n )< ^E(x, y) + C (wj-\ h w%) 

on this neighborhood. Thus, we can simply rescale the Wi to get coordinates Uj as required. 



The proof of the final inequality is based on Lemma 21 



Because the geodesic through zq is conjugate, the Hessian of h xy at zq cannot have full rank, 
as we see from Theorem 22 First assume that the rank is exactly n — 1. Then Lemma 21 shows 
that there are coordinates u$ around zq such that 

h X)V (vi, ...,v n ) = 2 E ( X > V) + V i^ 1" v n-i + °( v i) 

Then, after possibly rescaling v n , we see that the desired inequality holds. 

Next assume the Hessian of h X)V has rank less than n — 1. Then let ip be a smooth function on 
a neighborhood of zq which is non- negative, zero at z$ (hence with vanishing derivative at zq), and 
such that h X y + <p has Hessian of rank n — 1 at zq (<p looks like a sum of squares of an appropriate 
number of coordinates, for example). Applying the previous result to h X y + <p shows that there are 
coordinates Vi around zq such that 

(h x , y + <p) (vi, ■ ■ ■ , v n ) < ^E(x, y) + v\ H h v\_ x + v n . 

Since (p is non-negative, the desired estimate for h xy follows. □ 



These estimates allow us to say more about the integral appearing in Theorem 20 



Theorem 25. With the same assumptions and notation as Theorem 20, we have that for any 
sufficiently small neighborhood N(T) 

Pt (x, y) = ! (j) e -V»W/* ( Co(x , z)co(z , y ) + (t)) n(dz). (17) 
JN(r) V*/ 

Again, the "0(t)' ! term in the integral is uniform over N(T). Also, there exist positive constants 
Ci, and to (depending on M, x, and y) such that 

Gl ;e-*M'« < Pt (x,y) < -°* e -*tov)/« (18) 



t n/2- », - t „-(l/ 2 ) 

for < t < to. Further, if x and y are conjugate along any minimal geodesic connecting them, then 
(perhaps after changing to), we have 

for < t < to- Finally, if x and y are not conjugate along any minimal geodesic joining them, then 

Pt ( X ,y) = ^^e-^>yy« (20) 
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Remark 26. Notice that Corollary [T] in the Introduction is a direct consequence of formulas (17) 
and (15). Corollary [2] contains the estimates (18), (19) and (20). 

Proof. We begin with the general bounds on pt(x,y). Choose any z$ £ T and let V C N(T) be 
some neighborhood on which there are coordinates v = (v\, . . . , v n ) as in the previous lemma (that 



is, h x y is estimated by the sum of squares of the v{). Since the integrand in Theorem 20 is positive 
for sufficiently small t, we have that 



/ (-) 



-h»,yW/t ( Co (x,z)c (z,y) + O(t)) n(dz) 



> 



E{x,y)/2t f e -{-l+-+<)/t^ X)V ) c ^ Vi y) + {t)) fiidv) 



V 



for all sufficiently small, positive t. 

Because \i is a smooth volume and v a smooth coordinate system, we know that there is a 
smooth, positive function F such that jx{dv) = F(v)dvi • • • dv n . Then the results of the previous 



section, namely Equation (15), show that 



V 



( C0 (x, v)c (v, y) + 0(t)) F(v) dvx ■ ■ ■ dv n 



,n/2 



F(0) (c (x, zo)co(z , y) + 0(t)) W 2 + 0(t) 



(where we've used that T(l/2) = -y/vr). Note that there's no difficulty handling the 0(t) in the 
integrand since we simply estimate it by |0(£)| < Ct for some positive C and factor the t out of 
the integral. Putting this together with the fact that F(0)co(x, zq)cq{zq, y) is positive, we see that 
there exist positive C\ and to such that 



JV(r) 



•■»(*)/* (c (x,z)c (z,y) + O(t)) n{dz) > ^e- E{x ' y),%t 
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we note that the o ( exp 



-E{x,y)/2-& 
t 



term is 



for < t < to- Comparing this to Theorem 
dominated by the right-hand side of the aboveTnequality. Thus, after possibly adjusting C% and 
to, we see that the relevant inequality in the theorem holds. 

For the other side of the first inequality, note that we can find coordinates U{ as in the previous 
lemma around every point of T, and each of these systems of coordinates is defined on some open 
neighborhood. Because T is compact, there is a finite set of such neighborhoods which cover T; 
denote them by Ui, . . . , U m and the corresponding systems of coordinates by Uj = (uj^i, . . . , Uj t n) 
for j = 1, . . . , m. Now choose N(T) small enough so that N(T) C U^L 1 C/j. Then we have 



/ (-) 



-V„(*)/* (co(x,z)co(z,y) + 0(t)) M (dz) 



3=1 



E( X ,y)/2t / e -uyt (co(X;Uj)co(Uj)y) + Q{t)) ll{dUj 
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for all sufficiently small, positive t. As above, ^ is a smooth volume, and Equation (15) gives that, 
for each j, there is a positive constant Kj such that 

f e~ u h^ (co(x, Uj )co( Uj ,y) + 0{t)) n(d Uj ) = y/t (Kj + 0(t)) . 

Summing j from 1 to m allows us to conclude that there exists positive C2 such that, after possibly 
making to smaller, 



(| v.- 



h ^» t (c (x,z)co(z,y) + 0(t)) fx(dz) < -^e"^)/ 2 * 



for < t < to- Again, comparing this to Theorem 20 we see that the other side of the first 
inequality in the theorem holds, after possibly adjusting C2 and to- 



The two-sided inequality we've just proved now shows that the term o (exp 



-E(x,y)/2-8 
t 
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Theorem 20 is unnecessary; it can be "included" in the 0(t) term in the integral (as we've already 
taken advantage of above). This establishes the first claim in the theorem. 

Now we consider the case when x and y are conjugate along some minimal geodesic. Suppose 
that zq is the midpoint of this geodesic. Then we can find coordinates Vi around Zq, defined on 
some neighborhood V C N(T), such that 



hx 7 y(vi, ...,v n )< 2 E ( X > y) + v i 



+ v n _ ± + v n . 



Analogous to the previous lower bound, we have that 

'e-W*)/* ( Co (x,z)c (z,y) + O(t)) »(dz) 

>l^ eT^fir 1 j e-^ + - +v "-^ +v ") /t (c ( y x,v)c (v,y) + O(t)) fi(dv), 



N(F) 



for all sufficiently small, positive t. Equation (15) (along with smoothness of fi and positivity of 
the cq) then shows that, for some positive constants C3 and to (possibly different from before), 



> t (n/2)- ( i/4) (c 3+0 (yi)), 



for < t < to- Combining these estimates and the first claim in the theorem, we see that, after 
possibly adjusting C3 and to, 



Pt(x,y) > 



C 3 



t (n/2)+(l/4) 



-E(x,y)/2t 



for < t < to- 

Finally, we suppose that x and y are not conjugate along any minimal geodesic joining them. 
Then for any zq £ T, Theorem 22 and the Morse lemma imply that zq is isolated. Since T is 
compact, we see that in fact T consists of finitely many points, say z±, . . . ,z m (so there are only 
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finitely many minimal geodesies from x toy). Further, W6 CcLn find, coordinates Uj^i^ • • • > ^/,n 

around 

each Zj, on some neighborhood Uj, such that 



h x , y (uj t i, iij, n ) = 2 E ( X ' y) + n i,i ^ u \n on Uj, 

and N(T) is the disjoint union of the Uj (for small enough Uj). Thus, using the first claim in the 
theorem, 



3=1 

x co(x,Uj)c (uj,y) + O(i) fi(duj). (21) 
We have that jx{duj) = Fj(uj)duj^\ ■ ■ ■ duj tU for smooth, positive Fj. As above, we compute 

-(«?,!+•••+"£»)/* ( Co (x, Uj )co(uj,y) + 0(t)) • • • dn iin 

= t n ' 2 [Fj(0) (c (x, Zj)co(zj,y) + 0(t)) W 2 + 0(t) 
Summing over j, we have 

where C 4 = (4^) n/2 £™ =1 Fj(0)co (x, Zj)co(zj,y), which is clearly positive. □ 

One consequence of this result is that the exponent of 1/t in the small-time expansion of 
Pt(x,y) "sees" whether or not x and y are conjugate along any minimal geodesic. Said differently, 
the exponent of t detects the part of the cut locus of x which comes from conjugacy (assuming 
that the necessary geodesies are strictly normal, of course). That naturally leads to the question 
of what happens at cut points which are not conjugate. 

We first note that, if y is not in the cut locus of x, then the results of this analysis fit nicely 
with the expansion of Ben Arous, which applies in a neighborhood of y. In this case, there is a 
single minimal geodesic from x to y and it is not conjugate. Let z\ be the midpoint. Then the 
same analysis as in the last part of the previous proof (just with m = 1) shows that 

p t (x,y) = (F(z 1 ) Co (x,z 1 ) C o(z 1 ,y) + 0(t))^^ e -^)/ 2 *, 

where F(z\) is the density of \i with respect to coordinates which make the Hessian of h x>y at Z\ 
the identity matrix. Since the Ben Arous expansion applies to pt{x, y), we also have 

p t (x,y) = (c (x,y) + O(t)) -^e~ E ^/ 2t . 



So in this case, Theorem 25 provides a relationship between cq(x, y) on the one hand, and cq(x, zi), 
co(zi,y), and second-order behavior of h x>y at z\ (which is encoded by F{z\)) on the other. 

Now suppose that y is in the cut locus of x, but that none of the minimal geodesies from x 
to y are conjugate (and the assumptions of Theorem 25 hold, of course). Let 71 (s) be one such 
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geodesic, parametrized by arc-length so that 71 (0) = x and ji(d(x,y)) = y. Then we claim that 
lim s/ *^( x co(x, 71 (s)) exists and is positive, and we denote it u\. This follows from the relationship 
between co(x,7i(s)) and co(x, 7i(s/2)), co(7i(s/2), y), and F(7i(s/2) just discussed, and that fact 
that these last three quantities are continuous in s and remain positive. (Indeed, we've already seen 
in the proof of Theorem 25 that a\ = (47r) n ' /2 F{z\)cq(x, z\) co(zi,y) where z\ = ji(d(x, y)/2).) 
Alternatively, one can think of lifting a neighborhood of 71 ([0, s]) to a "local" universal cover and 
then applying the Ben Arous expansion. 

Continuing, we let 72, ... , 7 m be the other minimal geodesies from x to y, where we know that 
there can only be finitely many and that m must be at least 2. We let 02, ■ ■ ■ , ct m be the associated 
limits of co(x, •) along these geodesies, analogous to a%. Then the final part of the proof of Theorem 
[25] shows that 



Pt(x,y) 



in 



+ 0(t) 



t n/2 



-E(x,y)/2t 



The point of relating the coefficient of t^ n l 2 e~ d2<yX ^l u in the above to the cq along the jj is 
that we see that this coefficient is discontinuous at y. That is, for any 7^, we know that cq(x, 'jj(s)) 
is continuous in a neighborhood of 7j(s) as long as < s < d(x, y). However, when s increases to 
d(x, y), the value of this coefficient "jumps up" to the sum of the ay. Thus, points which are not in 
the cut locus of x and points that are but are not conjugate to x along any minimal geodesies both 
have small-time heat kernel expansions that look like a constant times t~ n ^ 2 e~ d2 ^ x ' y ^ 4:t . These two 
types of points can be distinguished by whether or not the coefficient (the constant) is continuous 
at the point in question. However, if one has that much information about the small-time heat 
kernel asymptotics in a neighborhood of a point y, then presumably one already understands d(x, •) 
near y, from which one should be able to understand the local structure of the cut locus. Thus 
looking at this coefficient, from the perspective of locating the cut locus, seems unlikely to be of 
much help. 

This potentially stands in contrast to the case when y is conjugate to x along a minimal geodesic, 
in which case only the power of t appearing in the expansion at the point y needs to be determined 
(in order to conclude that y is conjugate to x along a minimal geodesic). 



6 Examples 

In this section we discuss our results in some examples of 2-step sub-Riemannian structures. In 
these cases, an integral expression of the heat kernel (which can be explicitly written in some cases) 
has been found in [T7] , 

In the first example, namely the Heisenberg group, we briefly compute the Hessian of the hinged 
energy function h xy when x is the origin and y is a point on the cut locus. In this case, being that 
both the optimal synthesis and the heat kernel known explicitly, we verify the results of Theorem 

m 

The second example is the free nilpotent sub-Riemannian structure with growth vector (3,6). 
Here we use a "reverse" argument, starting from the formula for the heat kernel to find the asymp- 
totics for points belonging to the vertical subspace, where all points are both cut and conjugate. 
This asymptotic agrees with the fact there exists a one parameter family of optimal geodesies that 
reach this point (for a detailed discussion about the optimal synthesis see |45j). 
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In this section the heat kernel is meant for the intrinsic sub-Laplacian, i.e. it is computed with 
respect to the Popp volume. For the cases treated in this section this volume is proportional to 
the left Haar measure and is proportional to the Lebesgue measure in the standard system of 
coordinates we are using. 



6.1 Formula for the heat kernel in the 2-step case 

In this section we recall the expression of the heat kernel of the intrinsic sub-Laplacian associated 
with a 2-step nilpotent structure, that has been found in [17J. Then we rewrite it to have a 
convenient expression on the "vertical subspace" . 

Consider on W 1 a 2-step nilpotent structure of rank k < n, where Xi, . . . , X^ is an orthonormal 
frame. Once a smooth complement V for the distribution is chosen (i.e. TqW 1 = T> q © V q , for 
all q G 1") we can complete an orthonormal frame to a global one X\, . . . , X^, Y\,..., Y m , where 
m = n — k and V q = span„{li, . . . , Y m }. Since the structure is nilpotent, we can assume that the 
only nontrivial commutation relations are 

m 

[X l ,X J ] = Y J bt 3 Y h , (22) 

h=l 

where B\, ... , B m defined by = (6^) are skew-symmetric matrices (see [H] for the role of these 
matrices in the exponential map). 

Due to the group structure, the intrinsic sub-Laplacian takes the form of sum of squares A 



Yll=i x i ( see Remark 14). The group structure also implies that the heat kernel is invariant with 
respect to the group operation hence it is enough to consider the heat kernel pt(0, q) starting from 
the identity of the group, which we also denote pt(q)- The heat kernel is written as follows (see 
again [UJEI]) 

f\ 2 f WRf » ( W{B{t))x-x \ (z-t\ 
Pti<l) = / V ( B ( T )) exp cos — — dr, 



where q = (x,z), x G R k ,z £ M m , and B(t) := J2T=i T i B i- Moreover V : M nxn -> C and 
W : M raxn — > M nxri are the matrix functions defined by 



V(A) = Wdet I -A-r\ W{A) S 



s sin A J tan A 

Here Q is the Hausdorff dimension of the sub-Riemannian structure. 



Notice that ( 23 ) differs by some constant factors from the formulas contained in [T7] since there 
the heat kernel is the solution of the equation dt = |A. 

Remark 27. Assume that the real skew-symmetric matrix B{t) is diagonalizable and denote by 
±i\j(r), for j = 1, . . .£, its non zero eigenvalues. Then we have the formula for the expansion on 
the "vertical subspace" (i.e. where x = 0) 

p t ((0, z)) = - \ nl0 [ ff r cos ( —) dr. (23) 

Fny ' (4vrt)Q/ 2 7 Rm y sinhA^r) V t J V ' 
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6.2 The Heisenberg group 

The Heisenberg group is the simplest example of sub-Riemannian manifold. It is defined by the 
orthonormal frame T> = span{Xi, X 2 } on M 3 (with coordinates (x, y, z)) defined by 



X\ = d x - X 2 = d y + |<9 2 . 



Defining Z = d z , we have the commutation relations [Xi,X2] = Z and [X\,Z] = [X2,Z] = 0. 
Denote by £q : Ao x M + — > M the exponential map starting from the origin, where 

A = {po = (0, w) e T *M 1 e 5 1 , loeR}. 

For every po = G Ao with 7^ 0, the arclength geodesic j(t) = £o(po,t) = (x(t),y(t), z(t)) 

associated with the initial covector po is described by the equations 

x(t) = —(cos(wt + 0) — cos 0), 
w 

y ( t) = -(sm( wt + 0) -sin0), (24) 
w 

z (t) = — o{wt — sin wt). 
2w z 

and is optimal up to its cut time t cu t = 2n/w, with ^{t cu t) = (0, 0, ir/w 2 ). If w = 0, the geodesic is 
a straight line contained in the xy-plane and t cut = +00. 

From these properties it follows that the cut locus starting from the origin coincides with the 
z-axis, and for every point £ = (0,0, z) in this set we have d 2 (0,Q = 4k\z\. 

Remark 28. The expression of the heat kernel pt(q) for the Heisenberg group is well known and 
was first computed by Gaveau |31j and Hulanicki |34j . The integral formula for pt can be directly 



recovered from (23) since in this case there is a single skew-symmetric matrix B 

S =(_°1 J)> eig(B(r)) = {±ir}. 

Hence it follows 

, , 2 f°° T ( x 2 + y 2 T \ fZT\ , 

Pt( °' q) = (4^F }_„ sfimT 6XP { 4^t^nh7j C ° S vTJ ir ' 

On the vertical axis the integral can be explicitly computed 

2 f°° T (zt\ 1 1 

m°i C) = 7^— v? / ^"j— cos dr 



(47rt) 2 y,.^ sinhT U/ 8£ 2 1 + cosh {Tf 



Hence, using that d 2 (0,C) = 4ttz we have 

W (0,C) = ^exp = ^exp ("^^) iW), (25) 

where ^(i) is a smooth function of i, nonvanishing at 0. (Here z is fixed.) 
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In what follows, we recover the expansion (25) computing the expansion of the hinged energy 



function and applying Corollary [T] For reasons of symmetry it is not restrictive to consider only 
points C = (0, 0, z) such that z > (the on-diagonal expansion is a different situation). 

The set of minimal geodesies joining to ( = (0, 0, z) is parametrized by the covectors po = (9, w) 
where 9 G S , z = tt/w 2 . For each po, the associated geodesic 7 P0 satisfies 7 P0 (0) = and 
7 P0 (27r/w)) = £. Further, we have that the set of midpoints T is characterized as follows 



r = £ (S 1 ,w, 



TT 



It' 



x,y, 



2 i 2 

x + y 



2/w 



We introduce cylindrical coordinates (p,ip,z), where x = p cos <p, y = psinip. We have that 
(t, 9, w) forms a smooth coordinate system on N(T) (for N(T) small), where t represents the distance 
from the origin. Because of the invariance with respect to rotation around the z axis, to compute 
the Hessian of the hinged energy function h £ we are left to study the relationship between (p, z) 
and (t, w) near T. We have 



p(t,w) 



sm 



w 



wt\ 



and z(t, w) 



2w 2 



(wt — sin wt) 



Recall that 



VC^' ^ = 9 ( d (°> ^' Z ^ + d W> 



Using that t respresents the distance from the origin and exchanging the role of and £, one 



can get with some implicit differentiation for the matrix element of the Hessian of h 



: C 



dz- 



2w 2 , 



dp- 



■h Q ^(p,z) 



7T 

~2 



Wp~dz h ^ z) 



0. 



It follows that there exists a smooth change of coordinates (p, z) i— )■ (u, v) on a small disk 
perpendicular to T (with respect to the the usual M 3 metric) with the following three properties. 
First, T corresponds to the set where u and v are both zero. Second, h Q ^(u, v) 
N(T). Third, du = \dp on T and dv = wdz on T. Applying Theorem 
the constants one gets 
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+ u + v on 



and keeping track of all 



Pt(0,() 



1 

t 2 



exp 



tt 2 /w 2 \ / 487r(c (0,r))' 



+ 0(t) 



where co(0,r) is the constant apearing in the Ben Arous expansion. Taking into account that 
z = tt/w 2 , the heat kernel decays like a constant times t~ 2 exp(— 47rz/4t), which agrees with what 



one obtains from equation (25). 



6.3 (3,6) case 

The free nilpotent Lie group (3, 6) is the sub-Riemannian structure on M 6 (with coordinates 
(xi,X2,xs, zi, Z2, Z3)) defined by the distribution V = span{Ai, X2, A3}, where the vector fields 

X\ = d Xl - \x 2 d zz + \xzd Z2 , 
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X 2 = d X2 + ]^xid Zi - -X3d Zl , 



X3 = d X3 + -x 2 d Zl - -xid Z2 , 



define an orthonormal frame. If we set Zj = d Zi for i = 1, 2, 3 we have [X±, X 2 ] = Z$, [X2, X3] = Z%, 
and [X 3 ,X 1 ] = Z 2 . 



In this case the matrices B^ = (6^ ) defined by the identities pQ,Xj] = b^Y^ are 



B x = 1 , B 2 = 0, B, 






and for their linear combination B(r) = ^2j = iTjBj we have eig(5(r)) = {0,±i|r|}, where we 
denote by | • | the standard norm on IR 3 . 

Using (23) the explicit expression on the "vertical" subspace, i.e. at a point ( = (0, 0, 0, zi,z 2 , 23) 
is written as follows 

^-(S^jL-sh™^)*' (26) 



To compute the expansion of the heat kernel for M we use the fact that (26) is the Fourier 
transform of the radial function f(r) = |r|/ sinh |r|. 

Recall that, if F{x) = f{\x\) is a radial function defined on M m , its Fourier transform F{£) is 
itself a radial function, i.e. it is defined by F(£) = g(\£\), where g is the function of one variable 
that satisfies 

(n \m/2 r-ao 

9{P) = { -4eT- / Jn^(T P )T m/2 f(T)dT, p=\£\, 

P 2 JO 2 



and J denotes the Bessel function. In our case m = 3, we have J\/ 2 (s) = y ~ sins and 

9 (P) = 4vr r ^/(r)r 2 dr, 

JO PT 

Then we can rewrite our heat kernel as the 1-dimensional integral 



Using that 



8tt f°° t 2 sin pr \z\ 
(A-Ktyi 1 J Q p smhr t 



r 2 sin pr 2vr 3 sinh 4 (f) 

r -dr = : — 5 . . , p € 



psinhr psinh 3 (7rp) 
we can explicitly write the expression of the heat kernel for £ such that \z\ = 1 

n (n - sinh4 (i) ^ 



From (27) one can immediately show that for such £ 



limi 7/2 e?p t (C) = C > 0. (28) 
t-s-0 



The following lemma is a direct consequence of Theorem 17 
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Lemma 29. Assume that there exist a,K > 0, to > and constants C\,Ci > such that 



^ e -%<pt(x,y)<Qe-%, V0 < t < t . (29) 



Then K = d 2 (x,y). 



Proof. Since log is a monotone function, we can apply 4ilog both inequalities in (29) and letting 
t — > + (which is allowed since the estimate is uniform for small t) we have lim i _ s> o+ 4£ \ogpt{x, y) = 
—K and the statement follows from Theorem 1171 □ 



Proposition 30. Let £ = (0, 0, 0, z\, Z2, £3) with \z\ = 1. T/ien d 2 (0, C) = 47r and the following 
asymptotic expansion holds 

»<o = ^«p(-^)*>. 

where <p(i) is a smooth function nonvanishing at t = 0. Moreover £ is a conjugate point. 



Proof. This follows directly from (28), Lemma 29 and Corollary 2 □ 



Remark 31. From this analysis of the heat kernel and the homogeneity of the distance one gets 
the following information: (i). cZ 2 (0, C) = 47r|z| for every ( = (0,0,0,21,^2,2:3). (ii). The point £ is 
reached from the origin by an optimal geodesic that at time t = Ak\z\ is also conjugate. 

These facts were proved in [35] with a detailed analysis of the exponential map. (Notice that 
by symmetry is not difficult to prove that the point is conjugate to the origin along the geodesic. 
On the contrary, the difficulty is in proving that the geodesic does not lose optimality before the 
conjugate locus.) Our method via the analysis of the heat kernel provides a shorter proof. 



7 Grushin plane 

The Grushin plane is the generalized sub-Riemannian structure on M 2 for which an orthonormal 
frame of vector fields is given by 

X = d x , Y = xd y . (30) 

Since Y vanishes on the y-axis, this is a rank-varying sub-Riemannian structure and in particular 
is a 2-dimensional almost-Riemannian structure (see Appendix) . One immediately verifies that the 
Lie bracket generating condition is satisfied since [X, Y] = d y . 

In this section we compute the expansion of the heat kernel in the Grushin plane at a conjugate 
point, starting from a Riemannian point. 

The interesting feature of this structure is that it provides an example of almost Riemannian 
geometry in which the geodesic flow is completely integrable by means of trigonometric functions 
and, at the same time, the conjugate locus has the same structure of the conjugate locus of a 
generic 2-dimensional Riemannian metric. 



The sub-Riemannian Hamiltonian associated with the orthonormal frame (30) (in standard 
coordinates A = (p x ,Py, x,y) in T*M 2 ) is the smooth function 



H : T*R A — > R, H(p x ,p y ,x,y) = h P 2 x + x 2 p 2 y)- (31) 
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Since in this case there are no abormal minimizers (see [7j) the arclength geodesic flow starting 
from the Riemannian point qo = (— 1, — 7r/4) is computed as the solution of the Hamiltonian system 
associated with H, with initial condition (xo,yo) = ( — 1> — t/4) and (p x (0),p y (0)) = (cos#, sin#), 
where 6 S S 1 . The exponential map £ : M+ x S 1 — > M 2 starting from qo, is computed as follows 
(we omit the base point qo in the notation) 



£(t,e) = (x(t,0) ) y(t,O)), 



x(t,6) 

y(t,o) 



sin 



t sin( 



smtf 

7T 1 

4 4 sin ( 



(32) 



2t - 2 cos 9 + 



sin(26» - 2tsin<9) 



8111 ( 



with the understanding £ (t, 0) = limg_ s> o , ^(^ 



(t-l,-7T/4). 




Figure 1: Geodesies starting from go 



Let us consider the point q\ = (l,7r/4), the symmetric of qo with respect to the origin. The 
point qi is both a cut and a conjugate point from qo. Indeed from the results of [7] immediately 
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follows that the cut locus from go is the set Cut (go) = { (1 , vr/4 + s), s > 0}. Moreover 

£(M) = (0,0), (33) 

0=it/2 

shows that q± is also conjugate to go- Figure [I] shows some geodesies starting from the point go and 
the endpoints of all geodesies starting from go at time T = 4.8. 

Remark 32. Notice that the geodesic with initial covector 8 = tt/2 is the only one that reach gi 
optimally in time T = tt/2. The midpoint of the geodesic is the origin £ (7r/4,7r/2) = (0,0). (See 
also Figure [2j) 




Figure 2: The conjugate geodesic 



£(7r,7r/2) = gi, 



d 
d6 



We are interested in the small time asymptotic expansion of Pt(Qo,Qi), where pt denotes the 
heat kernel of the sub-Riemannian heat equation 

d t <p = Aip, A = X 2 + Y 2 = d 2 x + x 2 d 2 . 

Here the sub-Laplacian is not the intrinsic one but is computed with respect to the standard 
Lebegue measure of M. 2 . Indeed in this case the intrinsic volume \i = A dxdy is diverging along the 
singular set Z = {x = 0} hence our results does not apply since fx is not smooth. (See |22j for a 
discussion of the intrinsic heat equation in the Grushin plane.) 

An integral representation for the heat kernel for the operator d 2 + x 2 d 2 can be easily obtained 
by computing the Fourier transform with respect to the y variable and then using the Mehler kernel 
for the quantum harmonic oscillator. Its expression, given q = (x, y), q' = (a/, y') is 

. 1 f°° ( XX 1 T (x 2 + X' 2 ) T \ 

pm,q) = yz ^o /9 / exp 



(2vrt) 3 / 2 y^oo \ t sinh(2r) 2t tanh(2r) 

i(y-y')r\ 
exp dr. 



sinh(2r) \ t 

However from this formula it seems hard to find an asymptotic expansion for t small except on the 
diagonal at the origin. 
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Thanks to Corollary [TJ to compute the asymptotic expansion of the heat kernel Pt{Qo,Qi) we 
are reduced to study the expansion of the hinged energy function hq 0>qi near the origin (we omit 
the points in the notation in what follows) 

H x , V) = o ( rf2 (?0, (x, y)) + d 2 (qi, (x, y))) 



(d 2 (q , (x, y)) + d 2 (q , (-x, -y))) 



where the last identity follows from the symmetries of the structure and implies that the expansion 
of h at the origin contains only even order terms in (x, y) and we are reduced to compute the even 
terms of the expansion of the function (x,y) i-> d 2 (qo, (x,y)). 



Remark 33. By (33) and the proof of Theorem 22 it follows that the Hessian of the hinged energy 
function h is degenerate along the direction (—1,1) since 

£(^2,0) = (-1,1). 



d 
d9 



0=tt/2 



For this reason we consider the new coordinate system (x, y) around defined by x — 
The Hessian of h is diagonal in these coordinates. 



2 ' 



Using the fact that the geodesies defined by (32) are parametrized by arclength, we can com- 
pute the derivatives of the distance with respect to (x,y) by computing derivatives of t from (32) 
with implicit differentiation (as in Section 6.2). After some computations one finds the following 
expansion for h (we omit the bar in x, y for the new system of coordinates) 



t i \ a 2 , a ~ 32 4 

n(x,y) = 4:X H — x xy 

y ,y ' 24 6 



a - 32 



2„.2 



-x y 



(34) 



- X y3 + -y* + 0(\\(x,y)\f 



3^2 
2' 



Concerning our hinged energy function ( 34 ) one can also show that the following explicit change 
of coordinates 



<p(u,v) = (u + 



32 - a 



a 



a 



-u A v A u v + 

192 48 48 



16 — a 



32 



uv 2 , V ). 



diagonalizes h up to order 5. Namely 



a 



h{ V (u,v))=to' + -tf + 0{\\(v,v)\n 

A direct application of Corollary [l] (recall also Lemma 21 ), together with d(qo, q\) = 7r/2, gives 
Theorem 34. The heat kernel pt(qo,Qi) satisfies the following asymptotic expansion 

1 



Pt(qo,qi) 



e -m(C + 0(t)). 



(35) 



Remark 35. Notice that the same expansion as in (35) holds for the symmetric point qi 
(1,-3tt/4). If 9 {91,92} 

1 d 2 (q ,q) 

Pt{qo,q)~-e « (C + 0(t)). 
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Remark 36. Corollary [2] can be applied to compute the heat kernel asymptotics starting from the 
origin. In this case the cut locus is the y axes and these points are not conjugate. On the diagonal, 
applying the Leandre - Ben Arous result ([!]) with Q = 3 (or using the explicit formula for the 
heat kernel given above), one gets pt((0,0),(0,0)) ~ C/t 3 / 2 with C > 0. Off diagonal, applying 
Corollary [2j one gets Pt((0, 0), (x, y)) ~ C(x, y)/t for some C(x, y) > 0. 

The expansion of the heat kernel for the Grushin plane is summarized in the following table: 





diagonal 
(Leandre) 
(Ben Arous) 


off diagonal 
off cut locus 
(Ben Arous) 


off diagonal 
cut (non-conjugate) 
(Corollary 1) 


off diagonal 
cut conjugate 
(Corollary 2) 


Pt(q,q') 
q Riemannian point 


~ fi 
t 


_ C -d 2 (q,q')/(U) 
t c 


_ C -d 2 (q,q')/(4t) 
t C 


_ C -d 2 (q,q')/(4t) 
t 5/4 e 


Pt(q,q') 
q degenerate point 


c 


^ C -d?{q,q')/(U) 
t C 


„ C -d 2 (q,q')/(4t) 
t C 





A Extension to rank-varying sub-Riemannian structures 

In this section we give a more general definition of sub-Riemannian manifold (that we call rank- 
varying sub-Riemannian manifold). This definition includes also as a particular case Riemannian 
manifolds. For a more complete presentation one can see pjj. All the results of the paper hold for 
this more general structure. 

Let M be an n-dimensional smooth manifold. Given a vector bundle U over M, the C°°(M)- 
module of smooth sections of U is denoted by T(U). For the particular case U = TM, the set of 
smooth vector fields on M is denoted by Vec(M). 

Definition 37. An (n, k) -rank-varying distribution on an n-dimensional manifold M is a pair (U, /) 
where U is a vector bundle of rank k over M and / : U — > TM is a morphism of vector bundles, 
i.e. (i) the diagram 

Y 

M 

commutes, where tt : TM — > M and ttjj : U — > M denote the canonical projections and (ii) / is 
linear on fibers. Moreover, we require the map a \— > f o a from T(U) to Vec(M) to be injective. 

Let (U, /) be an (n, /c)-rank-varying distribution, A = {foa\a£ r(U)} be its associated 
submodule and denote by A q the linear subspace {V(q) \ V £ A} = f(U q ) Q T q M. Let Lie(A) 
be the smallest Lie subalgebra of Vec(M) containing A and, for every q S M, let Lie g (A) be the 
linear subspace of T q M whose elements are the evaluation at q of elements belonging to Lie(A). 
We say that (U, /) satisfies the Hormander condition if Lie g (A) = T q M for every q £ M. 

Definition 38. An (n,k) -rank-varying sub-Riemannian structure is a triple (U, /,(•,•)) where 
(U, /) is a Lie bracket generating (n, A;)-rank-varying distribution on a manifold M and (•, -) q is a 
scalar product on JJ q smoothly depending on q. 
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Several classical structures can be seen as particular cases of rank-varying sub-Riemannian 
structures, e.g., Riemannian structures (when U = TM and / = id) and constant-rank sub- 
Riemannian structures (as denned in Section [2]). An (n, n)-rank- varying sub-Riemannian structure 
is called an n- dimensional almost- Riemannian structure. An example of 2-almost Riemannian 
structure is provided by the Grushin plane, see [7J ?] . 

If 0"i, cifc is an orthonormal frame for (•, •) on an open subset 0, of M, an orthonormal frame 
in Q for the rank- varying sub-Riemannian structure is given by X\, . . . ,Xk, where Xj := /ojj. 
Orthonormal frames are systems of local generators of A. For every q E M and every v £ A q define 

G q (v) = in£{{u, u) q \u£ JJ q , f(u) = v}. 

Notice that if X\ , . . . , A& is an orthonormal frame for the rank- varying sub-Riemannian structure 
in 0, then it may happen that there exist ag£(] such that dimspan{Ai(g), . . . , A/%(g)} < k and 
that Gq(Xi(q)) < 1 for some i. 

A Lipschitz continuous curve 7 : [0, T] — > M is said to be horizontal (or admissible) if there 
exists a measurable essentially bounded function 

[0,T]3t^u(t)eV j(t) , 

called control function, such that j(t) = f(u(t)) for almost every t £ [0,T]. Given an admissible 
curve 7 : [0, T] — > M, the length 0/7 is 

£(7) = £ y/G y(t) m))dt. 

The Carnot-Caratheodory distance is defined as 

d(qo,qi) = inf{^(7) | 7(0) = q ,7(T) = 91,7 admissible}. 

As in the classical sub-Riemannian case, the hypothesis of connectedness of M and the Hormander 
condition guarantees the finiteness and the continuity of d(-, •) with respect to the topology of M. 

For rank-varying sub-Riemannian structures the definitions of minimizers, geodesies, normal 
and abnormal extremals and the formulation of the Pontryagin Maximum Principle are the same 
as in the constant rank case. Also the definition of cut and conjugate loci are the same. Thanks 
to the injectivity assumption, the definition of the horizontal gradient is still V99 = Yli=i Xi((p)Xi. 
The definition of the Popp's volume is instead more delicate, since the volume diverges while 
approaching a point in which there is a drop of rank of the distribution. However, for a smooth 
volume \i the sub-Laplacian still has the form A = X^i=i %f + (divAj)Xj, and all the results of the 
paper hold in this case. 

Acknowledgements. The authors would like to thank Fabrice Baudoin and Andrei Agrachev 
for helpful discussions. The authors also thank IHP for its hospitality during the finishing of this 
paper. 
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